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Abstract 

We consider the statistical analysis of random sections of a 
spin fibre bundle over the sphere. These may be thought of as 
random fields that at each point p 6 § 2 take as a value a curve 
(e.g. an ellipse) living in the tangent plane at that point T p § 2 , 
rather than a number as in ordinary situations. The analysis of 
such fields is strongly motivated by applications, for instance po- 
larization experiments in Cosmology. To investigate such fields, 
spin needlets were recently introduced by [21] and [20]. We con- 
sider the use of spin needlets for spin angular power spectrum 
estimation, in the presence of noise and missing observations, 
and we provide Central Limit Theorem results, in the high fre- 
quency sense; we discuss also tests for bias and asymmetries with 
an asymptotic justification. 
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1 Introduction 

The analysis of (random or deterministic) functions denned on the sphere 
by means of wavelets has recently been the object of a number of the- 
oretical and applied papers, see for instance [3], [4], [5, 61], [41, 42], 
[22, 23, 24], [7, 8], [27]. Many of these works have found their motivating 
rationale in recent developments in the applied sciences, such as Medical 
Imaging, Geophysics, Atmospheric Sciences, Astrophysics and Cosmol- 
ogy. These same fields of applications are now prompting stochastic 
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models which are more sophisticated (and more intriguing) than ordi- 
nary, scalar valued random fields. In this paper, we shall be especially 
concerned with astrophysical and cosmological applications, but several 
similar issues can be found in other disciplines, see for instance [55] for 
related mathematical models in the field of brain mapping. 

Concerning astrophysics, there are now many mathematical papers 
which have been motivated by the analysis of so-called Cosmic Mi- 
crowave Background radiation (CMB); the latter can be very loosely 
viewed as a relic electromagnetic radiation which permeates the Universe 
providing a map of its status from 13.7 billions ago, in the immediate 
adjacency of the Big Bang. Almost all mathematical statistics papers in 
this area have been concerned with the temperature component of CMB, 
which can be represented as a standard spherical random field (see [13] 
for a review). We recall that a scalar random field on the sphere may 
be thought of as a collection of random variables {T(p) : p 6 § 2 }, where 
S 2 = [p : ||p|| 2 = 1} is the unit sphere of R 3 and |.| denotes Euclidean 
norm. T{p) is isotropic if its law is invariant with respect to the group 

of rotations, T(p) = T(gp) for all g G 5*0(3), where = denotes equality 
in distribution of random fields and SO (3) can be realized as the set of 
ortho normal 3x3 matrices with unit determinant. 

However, most recent and forthcoming experiments (such as Planck, 
which was launched on May 14, 2009, the CLOVER, QUIET and QUAD 
experiments or the projected mission CMBPOL) are focussing on a much 
more elusive and sophisticated feature, i.e. the so-called polarization of 
CMB. The physical significance of the latter is explained for instance in 
[12, 31, 56]; we do not enter into these motivations here, but we do stress 
how the analysis of this feature is expected to provide extremely reward- 
ing physical information. Just to provide a striking example, detection 
of a non-zero angular power spectrum for the so-called 5-modes of po- 
larization data (to be defined later) would provide the first experimental 
evidence of primordial gravitational waves; this would result in an im- 
pressive window into the General Relativity picture of the primordial Big 
Bang dynamics and as such it is certainly one of the most interesting 
perspectives of current physical research. Polarization is also crucial in 
the understanding of the so-called reionization optical depth, for which 
very little information is available from temperature data, see [20] for 
more discussion on details. 

Here, however, we shall not go deeper into these physical perspec- 
tives, as we prefer to focus instead on the new mathematical ideas which 
are forced in by the analysis of these datasets. A rigorous understand- 
ing requires some technicalities which are postponed to the next Section; 
however we hope to convey the general idea as follows. We can imagine 
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that experiments recording CMB radiation are measuring on each direc- 
tion p G § 2 a random ellipse living on T p § 2 , the tangent plane at that 
point. The "magnitude" of this ellipse (= c 2 = a 2 + b 2 in standard ellipse 
notation), which is a standard random variable, corresponds to temper- 
ature data, on which the mathematical statistics literature has so far 
concentrated. The other identifying features of this ellipse (elongation 
and orientation) are collected in polarization data, which can be thought 
of as a random field taking values in a space of algebraic curves. In more 
formal terms (to be explained later), this can be summarized by saying 
that we shall be concerned with random sections of fibre bundles over 
the sphere; from a more group-theoretic point of view, we shall show 
that polarization random fields are related to so-called spin-weighted 
representations of the group of rotations SO (3). A further mathemati- 
cal interpretation, which is entirely equivalent but shall not be pursued 
here, is to view these data as realizations of random matrix fields (see 
again [55]). Quite interestingly, there are other, unrelated situations in 
physics where the mathematical and statistical formalism turns out to 
be identical. In particular gravitational lensing data, which have cur- 
rently drawn much interest in Astrophysics and will certainly make up a 
core issue for research in the next two decades, can be shown to have the 
same (spin 2, see below) mathematical structure, see for instance ([10]). 
More generally, similar issues may arise when dealing with random de- 
formations of shapes, as dealt with for instance by ([2]). 

The construction of a wavelet system for spin functions was first ad- 
dressed in [21]; the idea in that paper is to develop the needlet approach 
of [41, 42] and [22, 23, 24] to this new, broader geometrical setting, 
and investigate the stochastic properties of the resulting spin needlet 
coefficients, thus generalizing results from [7, 8]. A wide range of pos- 
sible applications to the analysis of polarization data is discussed in 
[20]. Here, we shall focus in particular on the possibility of using spin 
needlets for angular power spectrum estimation for spin fields, an idea 
that for the scalar case was suggested by [7]; in [48], needlets were used 
for the estimation of cross-angular power spectra of CMB and Large 
Scale Structure data, in [18], [19] the estimator was considered for CMB 
temperature data in the presence of faint noise and gaps, while in [49] 
the procedure was implemented on disjoint subsets of the sphere as a 
probe of asymmetries in CMB radiation. 

The plan of this paper is as follows: in Section 2 we present the 
motivations for our analysis, i.e. some minimal physical background on 
polarization. In Section 3 and 4 we introduce the geometrical formal- 
ism on spin line bundles and spin needlets, respectively, and we define 
spin random fields. Sections 5, 6 and 7 are devoted to the spin needlets 
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spectral estimator and the derivation of its asymptotic properties in the 
presence of missing observations and noise, including related statistical 
tests for bias and asymmetries. Throughout this paper, given two pos- 
itive sequences {a? },{&?} we shall write a,j ~ bj if there exist positive 
constants Ci, C2 such that c\dj < bj < C2<2 j for all j > 1. 

2 Motivations 

The classical theory of electromagnetic radiation entails a characteriza- 
tion in terms of the so-called Stokes' parameters Q and U, which are 
defined as follows. An electromagnetic wave propagating in the z direc- 
tion has components 

E x (z, t) = E 0x cos(r + 5 X ) , E y (z, t) = E 0y cos(r + 8 y ) , (1) 

where r := cut — kz is the so-called propagator and v = 2-nuj/k is the 
frequency of the wave. (1) can be viewed as the parametric equations 
of an ellipse which is the projection of the incoming radiation on the 
plane perpendicular to the direction of motion. Indeed, some elementary 
algebra yields 



The magnitude of the ellipse (i.e., the sums of the squares of its semi- 
major and semiminor axes) is given by 



T has the nature of a scalar quantity, that is to say, it is readily seen 
to be invariant under rotation of the coordinate and y. It can 

hence be viewed as an intrinsic quantity measuring the total intensity of 
radiation; from the physical point of view, this is exactly the nature of 
CMB temperature observations which have been the focus of so much 
research over the last decade. It should be noted that, despite the non- 
negativity constraint, in the physical literature on CMB experiments T 
is usually taken to be Gaussian around its mean, in excellent agreement 
with observations. This apparent paradox is explained by the fact that 
the variance of T is several orders of magnitude smaller than its mean, 
so the Gaussian approximation is justifiable. 

The characterization of the polarization ellipse is completed by in- 
troducing Stokes' parameters Q and U, which are defined as 



E 2 x (z,t) , El 



(z,t) 2 E x (z,t) E y (z,t) 



cos 5 = sin 2 5 , 5 := 5 y — 5, 




Q = E 2 0x - E 2 Qy , U = 2E 0x E 0y cos 5 . 



(2) 
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To provide a flavour of their geometrical meaning, we recall from ele- 
mentary geometry that the parametric equations of a circle are obtained 
from (1) in the special case Eq x = Eq v , 5 x = 5 y + tt/2, whence the circle 
corresponds to Q — U = 0. On the other hand, it is not difficult to 
see that a segment aligned on the x axis is characterized by Q = T, a 
segment aligned on the y axis by Q = —T, for a segment on the line 
y = ±x we have 5 X — 5 y = 0, ir, and hence Q — 0, U — ±T, respectively. 
The key feature to note, however, is the following: while T does not de- 
pend on any choice of coordinates, this is not the case for Q and U, i.e. 
the latter are not geometrically intrinsic quantities. However, as these 
parameters identify an ellipse, it is natural to expect that they will be 
invariant under rotations by 180° degrees and multiples thereof. This 
is the first step in understanding the introduction of spin random fields 
below. 

Indeed, it is convenient to identify M 2 with the complex plane C 
by focussing on w = x + iy; a change of coordinates corresponding to a 
rotation 7 can then be expressed as w' = exp(ry)tu, and some elementary 
algebra shows that the induced transform on (Q, U) can be written as 

(Q'\ = ( cos 2 7 sin 2 7 \ (Q'\ 
\U'J V -sin 2 7 cos 2 7 j \U'J ' 

or more compactly 

Q' + iU' = exp(i2 7 )(Q + iC/) . (3) 

In the physicists' terminology, (3) identifies the Stokes' parameters as 
spin 2 objects, that is, a rotation by an angle 7 changes their value 
by exp(i27). As mentioned before, this can be intuitively visualized by 
focussing on an ellipse, which is clearly invariant by rotations of 180°. 
To compare with other situations, standard (scalar) random fields are 
clearly invariant (or better covariant) with respect to the choice of any 
coordinate axes in the local tangent plane, and as such they are spin zero 
fields; a vector field is spin 1, while we can envisage random fields taking 
values in higher order algebraic curves and thus having any integer spin 
s > 2. 

As mentioned earlier, it is very important to notice that polariza- 
tion is not the only possible motivation for the analysis of spin random 
fields. For instance, an identical formalism is derived when dealing with 
gravitational lensing, i.e. the deformation of images induced by gravity 
according to Einstein's laws. Gravitational lensing is now the object of 
very detailed experimental studies, which have led to huge challenges on 
the most appropriate statistical methods to be adopted (see for instance 
[10]). We defer to future work a discussion on the statistical procedures 
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which are made possible by the application of spin needlets to lensing 
data. 

3 Geometric Background 

In this Section, we will provide a more rigorous background on spin 
functions. Despite the fact that our motivating applications are limited 
to the case s = 2, we will discuss here the case of a general integer s G Z, 
which does not entail any extra difficulty. 

A more rigorous point of view requires some background in Differ- 
ential Geometry, for which we refer for instance to [9] and [1]. The 
construction of spin functions is discussed in more detail by [21], which 
builds upon a well-established physical literature described for instance 
in [43], [56], [12]. 

Say p G § 2 . We recall first the tangent plane T p §> 2 , which is defined as 
usual as the linear space generated by the collection of tangent vectors 
at p. To proceed further to spin random fields, we need to recall from 
Geometry the notion of a fibre bundle. The latter consists of the family 
(E,B,n,F), where E, B, and F are topological spaces and n : E — > B 
is a continuous surjection satisfying a local triviality condition outlined 
below. The space B is called the base space of the bundle, E the total 
space, and F the fibre; the map n is called the projection map (or bundle 
projection). In our case, the base space is simply the unit sphere B = 8 2 ; 
it is tempting to view the fibres as ellipses (or vectors, for s — 1, or more 
general algebraic curves, for s > 3) lying in T p §> 2 , however one must 
bear in mind that to characterize the ellipse we would need to focus 
jointly on (T, Q, U), while our analysis below is restricted to the Stokes' 
parameters Q and U. 

The basic intuition behind fibre bundles is that they behave locally as 
simple Cartesian products B x F. The former intuition is implemented 
by requiring that for all p G S 2 there exist a neighbourhood U — U{p) 
such that 7r _1 (i7) is homeomorphic to U x F, in such a way that n carries 
over to the projection onto the first factor. In particular, the following 
diagram should commute: 

n-\U)y±+U x F 

proj 

u 

where is a homeomorphism and proj is the natural projection. The 
set n~ 1 (x) is homeomorphic to F and is called the fibre over x. The fibre 
bundles we shall consider are smooth, that is, E, B, and F are required 
to be smooth manifolds and all the projections above are required to be 
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smooth maps. 

In our case, we shall be dealing with a complex line bundle which 
is uniquely identified by fixing transition functions to express the trans- 
formation laws under changes of coordinates. Following ([21]) (see also 
[28, 43]), we define Ui := § 2 \ {N, 5} to be the chart covering the sphere 
with the exception of the North and South Poles, with the usual coor- 
dinates (i?, tp). We define also the rotated charts Ur = RUj\ in this new 
charts, we will use the natural coordinates (i?r, ip R ). At each point p of 
Ur, we take as a reference direction in the tangent plane T P S 2 , the tan- 
gent vector d/difR, (which points in the direction of increasing ifR and 
is tangent to a circle 9 R = constant). Again as in [21], we let let ^> p r 2 r 1 
be the (oriented) angle from d/difR 1 to d/dipR 2 (for a careful discussion 
of which is the oriented angle, see [21]); this angle is independent of 
any choice of coordinates. We define a complex line bundle on 5 2 by 
letting exp(isip P R 2 R 1 ) be the transition function from the chart Ur 1 to 
Ur 2 . A smooth spin function / is a smooth section of this line bundle. 
/ may simply be thought of as a collection of complex-valued smooth 
functions (/ij)_Rg5o(3) 5 with f R defined and smooth on Ur„ such that for 
all Ri, i?2 £ 5*0(3), we have 

fih(p) = exp(isilj p R 2Rl )f Rl (p) 

for all p in the intersection of Ur 1 and Ur 2 . 

An alternative, group theoretic point of view can be motivated as 
follows. Consider the group of rotations 5*0(3); it is a well-known that, 
by elementary geometry, each element g can be expressed as 

g = R z (a)R x (P)Rz(l) , < a < tt , < (3, 7 < 2tt , (4) 

where R z {.) and R x (.) represent rotations around the z and x axis, re- 
spectively; in words, (4) is stating that each rotation can be realized by 
rotating first by an angle 7 around the z axis, then by an angle f3 around 
the x axis, then again by an angle a around the z axis. We denote as 
usual by {D l ( )} l=Q12 the Wigner family of irreducible matrix repre- 
sentations for 50(3); in terms of the Euler angles, the elements of these 
matrices can be expressed as 

KnmM =exp(-i mi a)d l mim2 (p)exp(-im 2 -f) . 

Note that D l mim2 (g) = {-l) mi ~ m2 D l _ mi ^ ni2 {g). Standard results from 
group representation theory ([17, 58, 59]) yield 

m 2 
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and 

dg denoting the standard uniform (Haar) metric on SO (3). The ele- 
ments of |-D'(.)} / _ 1 2 thus make up an orthogonal system which is 
also complete, i.e., it is a consequence of the Peter- Weyl theorem ([17]) 
that all square integrable functions on 5*0(3) can be expanded, in the 
mean square sense, as 

/(#) = EE ^r hl mim2 Dl mim2 {9) , 

l mim2 

where the coefficients {b l mim2 } can be recovered from the inverse Fourier 
transform 

b L im2 = / f{g)D l mim2 {g)dg . 

JSO(3) 

By elementary geometry, we can view the unit sphere as the quotient 
space § 2 = SO(3)/SO(2) and the functions on the sphere as those which 
are constants with respect to the third Euler angle 7, i.e. /(a, /3, 7) = 
f(a,(3, 7') for all 7,7'. It follows that 



Isc 



f{g)D l mim2 {g)dg 

! SO(3) 

"2"7T P7T p27T 



= (-1)"* 1^ r I W f(g) exp(tm ia )d l _ mi _ m2 ((3)exp(tm 2l ) sin ^dad(3d 7 
Jo Jo Jo 



for m 2 7^ 

In view of the well-known identity 



2nb l mi0 otherwise 



21 + 1 4 



Y lm ((3 : a) = \/^—d l m0 (Py 



(where we have used d l mn (/3) = (— l) m ~ n d l _ m _ n (/3), see [58], equation 
4.4.1), we immediately obtain the expansion of functions on the sphere 
into spherical harmonics, i.e. 



f(p) = -^- bl rno D Lo(p) = J2 aimYim ^ ' aim = y~^r b 



m Im 



-mO 
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We can hence loosely say that standard scalar functions on the sphere 
"live in the space generated by the column s = of the Wigner's D 
matrices of irreducible representations", see also [37]. Now from the 
Peter- Weyl Theorem we know that each of the columns s = —I,..., I 
spans a space of irreducible representations, and these spaces are mutu- 
ally orthogonal; it is then a natural, naive question to ask what is the 
physical significance of these further spaces. It turns out that these are 
strictly related to spin functions, indeed we can expand the fibre bundle 
of spin s functions as 



2/ -l- 1 



(5) 



Spin s functions can then be related to the so-called spin weighted rep- 
resentations of 5*0(3), see for instance [11]. Now by standard group 
representation properties we have that 

2/ 4- 1 

f s ((R z (i)p) = ££17^(^(71)4(^,7) 

I m 

^ ^ 2/ + 1 

= Yl Yl ~^~ blms ex P( is 7)^L(^, 7) 

l m 

= esxp(isj)f a {p) , 

as expected. 

The analogy with the scalar case can actually be pursued further 
than that. It is well-known that the elements D l mQ , m = — /, ...,/ of the 
Wigner's D matrices are proportional to the spherical harmonics Yi m , 
i.e. the eigenfunctions of the spherical Laplacian operator 
As 2 ^5m = —1(1 + l)Yi m . It turns out that this equivalence holds in much 
greater generality and for all integer s and I > s there exist a differ- 
ential operator 55 such that — 55-D^ s = ei s D l ms , where {ei s } l=ss+1 = 
{(I — s)(l + s + 1)} S s+1 is the associated sequence of eigenvalues (note 
that for s = we are back to the usual expressions for the scalar case, 
as expected). The operators 5, 5 are defined as follows, in terms of their 
action on any spin s function / s (.), 



5/ s (tf )V ?) = -(sin^ 
9/ a (0,<p) = -(sin0)- 



d 



+ 



d 



8$ sin i? d(p 
d id 



dd sin dip 



Bm0)-f,(0,<p) , (6) 
(sin 0)' . (7) 



In (6) one should more rigorously write (9/ s )/ on the left side and 
(f s )i on the right side. In fact, if on the right side of (6) we replace 
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(■&, tp) by (& R , tp R ) and f s by (f s ) R , the result is in fact (3/ s )h(#ji, tp R ) 
(see [21]); similarly in (7). 

The spin s spherical harmonics can then be identified as 

Y lms (^<P) = (-l) m J^DL ms (^$,-^)eM-^) 

r — (°) 
= (-l) m ^/^exp(tmp)d l _ ms ^) ; 

again, the previous expression should be understood as Yz ms; /($, tp), i.e. 
spin spherical harmonics are clearly affected by coordinate transforma- 
tion, but we drop the reference to the choice of chart for ease of notation 
whenever this can be done without the risk of confusion. The spin spher- 
ical harmonics can be shown to satisfy 

Y lm>a+1 = [(l-s)(l + s + 1)]- 1/2 m m ,s , (9) 



= -[(i + s)(i-s + i)y i/i m m , s , (io) 

which motivates the name of spin raising and spin lowering operators 
for 5, 9. Iterating, it can be shown also that (see [21]) 

Y lms = {|^y|} ' , for s > , 

Yi ms = {jT^J i-d)- S Yi m , for s < . 

Further properties of the spin spherical harmonics follow easily from 
their proportionality to elements of Wigner's D matrices; indeed we 
have (orthonormality) 

/ Y lms (p)Y^7s(p)dp= r r Y lms ($, v)Yi>m>($, V) smtidVdip = 6\' 5™' 
Js 2 Jo Jo 

Viewing spin-spherical harmonics as functions on the group SO (3) (i.e. 
identifying p = (i?, tp) as the corresponding rotation by means of Euler 
angles), using (8) and the group addition properties we obtain easily, for 
p,p' G S 2 , that 



^2 Ylms (?) Ylms (p') 



m=— I 



21 + 1 

An 



D -ms(V, 0)D l _ ms (tp', 0) 



^D l _ s M(p,p')) , 
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where ip(p,p') denotes the composition of the two rotations (explicit 
formulae can be found in [58]). In the special case p = p' and R — R', 
we have immediately 

j o; I i 

J2 Yi ms (p) Y lms (p) = , (11) 

m=— I 

see also [21] for an alternative proof. 

By combining (5) and (8) the spectral representation of spin functions 
is derived: 

f a (&, V) = J2J2 a i;msYi ;ma (&, <p) ■ (12) 

/ m 

From (12), a further, extremely important characterization of spin func- 
tions was first introduced by [43], see also and [21] for a more mathemat- 
ically oriented treatment. In particular, it can be shown that there exists 
a scalar complex-valued function g(-&, cp) = Re {g} (■&, ip)+ilm {g} (•&, ip), 
such that, such that 

= ^2 a lrn;EYlms{$, <P) + % ^2 a lrn;BY lms ('&, (f) , (13) 
Im Im 

where 

M#,<p) = (dyRe{ 9 }($, v ) , f B (#,<p) = (m^ig} ■ 

Note that a i;ms = a im . E + ia lm . B , where a im . E = ai- m . E , a lm . B = a^ m . B . 
It is also readily seen that 



ai;ms + 0>l;-ms ~ &lm;E + idlm;B + dlm;E ~ idlm;B ~ ^0-lm;E , 
0>l;ms — Cll-- m s = dlm;E + idlm;B ~ dlm;E + iaim;B = ^idlm;B ■ 

In the cosmological literature, {ai m . B } and {ai m - B } are labelled the 
E and B modes (or the electric and magnetic components) of CMB 
polarization. 

4 Spin Needlets and Spin Random Fields 

We are now in the position to recall the construction of spin needlets, as 
provided by [21]. We start by reviewing a few basic facts about standard 
(scalar) needlets. Needlets have been defined by [41, 42] as 

1>ik(p) = V%~kY, b (^j) £*W(p)>W(&*) >P e§2 > ( 14 ) 

I ^ ' m=-l 
11 



where {£jk, ^jk} are a set of cubature points and weights ensuring that 



b(.) is a compactly supported, C°° function, and B > 1 is a user-chosen 
"bandwidth" parameter. The general cases of non-compactly supported 
functions b(.) and more abstract manifolds than the sphere were studied 
by [22, 23, 24]. The stochastic properties of needlet coefficients and their 
use for the analysis of spherical random fields were first investigated 
by [7, 8], see also [32, 38, 33, 18] for further developments. Several 
applications have already been provided to CMB data analysis, see for 
instance [48, 36, 14, 19, 49, 50, 53, 51, 54]. 

For a fixed B > 1, we shall denote by {Xj}JL Q the nested sequence of 
cubature points corresponding to the space /C^+i], where [.] represents 
as usual integer part and KL L = ®i =0 Hi is the space spanned by spherical 
harmonics up to order L. It is known that {Xj}^L can be taken such that 
the cubature points for each j are almost uniformly €j — distributed with 
€j := nB^i , the coefficients {\jk} are such that cB~ 2 i < \jk < c'B" 2 ^ , 
where c, d are finite real numbers, and card {Xj} pa B 2 K Exact cubature 
points can be defined for the spin as for the scalar case, see [6] for details; 
for practical CMB data analysis, these cubature points can be identified 
with the centre pixels provided [29], with only a minor approximation. 

Spin needlet s are then defined as (see [21]) 



As before, {\jk,£,jk\ are cubature points and weights, b(-) G C°° is 
nonnegative, and has a compact support in [1/B,B]. The expression 
(15) bears an obvious resemblance with (14), but it is also important to 
point out some crucial differences. Firstly, we note that the square root 
of the eigenvalues ^/ef s has replaced the previous /. This formulation is 
instrumental for the derivation of the main properties of spin needlets 
by means of differential arguments in ([21]); we stress, however, that this 
is actually a minor difference, as all our results are asymptotic and of 
course 



A much more important feature is as follows: (15) cannot be viewed 
as a well-defined scalar or spin function, because Yi. ms (p) , Yi ]ms are 





(15) 
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spin(s and — s) functions defined on different point of S 2 , and as such 
they cannot be multiplied in any meaningful way (their product depends 
on the local choice of coordinates). Hence, (15) should be written more 
rigorously as 

^jk;s (p) = V^jk~Y^ b ( "^T j Y i Yl > ms ^ Yl ' ms ' 

l ^ ' m=-l 

^jk;s (p) = V\jkJ2t> ( "f^T ) Y {Y^(P) ® Y l\™ (&k)} , 
I ^ ' m=-l 

where we denoted by ® the tensor product of spin functions; spin needlets 
can the be viewed as spin {— s, s} operators (written T_ SjS ), which act 
on a space of spin s functions square integrable functions to produce a 
sequence of spin s square-summable coefficients, i.e. T_ S)S : L 2 S — > <? s . 
This action is actually an isometry, as a consequence of the tight frame 
property, see [6] and [25]. 

For any spin s function f s , the spin needlet transform is defined by 

/ fs(p)^jk;s(p) d P = Pjk;s , 

Js 2 

and the same inversion property holds as for standard needlets, i.e. 

jk 

the equality holding in the L 2 sense. The coefficients of spin needlets 
can be written explicitly as 

Pjk;s= I fs(p)^jkM d P=V^Y b \~^) Y a hmsYl- ms (£,-*) . 
^ S2 I ^ ' m=-l 

(16) 

Remark 1 To illustrate the meaning of these projection operations, and 
using a notation closer to the physical literature, we could view spin s 
quantities as "bra" entities, i.e. write (T(p) , (f3jk-, s , an d spin —s as 
"ket" quantities, i.e. write for instance Yi- ms (p)) . Then we would obtain 

I fs(p)^jk;2(p)dp = V^j~kJ2 b [ y ^f) Y I ' Yl -> ms ^) ( Yl -> ms ^jk) d P 

= V\jk~Y< b (^^P) Y ai ' ms ( Y l\ms (€jk) , 



m=—l 
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which is a well-defined spin quantities, as the inner product (f s (p), Y[. ms (p)) 
yields a well-defined, complex-valued scalar. However we shall not use 
this "Dirac" notation later in this paper, as we hope the meaning of our 
manipulations will remain clear by themselves. 

The absolute value of spin needlets is indeed a well-defined scalar 
function, and this allows to discuss localization properties. In this frame- 
work, the main result is established in [21], where it is shown that for 
any MgN there exists a constant cm > s.t., for every £ e S 2 : 



1 ^ (i +B iJl B s((^))r uniformly in u k) ' (17) 

i.e. the tails decay quasi-exponentially. 

We are now able to focus on the core of this paper, which is related 
to the analysis of spin random fields. As mentioned in the previous dis- 
cussion, we have in mind circumstances where stochastic analysis must 
be developed on polarization random fields {Q±iU}, which are spin 
±2 random functions. 

Hence we shall now assume we deal with random isotropic spin func- 
tions f s , by which we mean that there exist a probability space P), 
such that for all choices of charts Ur, the ordinary random function 
(f s )n, defined on f2 x S 2 , is jointly 3 x B(Ur) measurable, where B(Ur) 
denotes the Borel sigma-algebra on Ur. In particular, for the spin 2 
random function (Q + ill) (p) as for the scalar case, the following repre- 
sentation holds, in the mean square sense ([21]) 



{Q + iU} = a «m;2^;m2 , 



Im 

i.e. 

2 

dp = . 



lim E 



{Q + if}(p)-££ Yl;rra{p) 



1=1 m=—l 



Note that the quantity on the left-hand side is a well-defined scalar, 
for all L. The sequence {ai m 2 = ai m -E + iaim-s} is complex- valued and is 
such that, for all li, / 2 , mi, m 2 , 

Eai irni -Eai 2m2] E = Eai imi] Ba,i 2ni2] E = Eai irni -Eai 2m2; B = Eai irni .Eai 2 „ l2 -B = , 
and 

Eai m -Eai'm';E = Cie$ 1 i $™ , Eai m . B ai'm';B = ClB$l $m , 
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where 



E 



21 + 1 
An 



Cie , Yl 



21 + 1 
An 



Q B < oo . 



The spin (or total) angular power spectrum is defined as 




: Ci — {Cie + Cib} ■ 



In this paper, we shall be dealing with quadratic transforms of ran- 
dom needlet coefficients; as in the earlier works in this area, will use the 
diagram formulae (see for instance [46, 57]) extensively, and we provide 
here a brief overview to fix notation. Denote by H q the q— th order 
Hermite polynomials, defined as 



Diagrams are basically mnemonic devices for computing the moments 
and cumulants of polynomial forms in Gaussian random variables. Our 
notation is the same as for instance in [34, 35], where again these tech- 
niques are applied in a CMB related framework. Let p and / ij = 1 , . . . , p 
be given integers. A diagram 7 of order (h,...,l p ) is a set of points 
{(j J) '■ 1 < j < P, 1 < I < lj} called vertices, viewed as a table 
W = l\ <S> • ■ • <E> l p and a partition of these points into pairs 

{((j, I), (k, s)) : 1 < j < k < p; 1 < I < lj, 1 < s < l k }, 

called edges. We denote by I(W) the set of diagrams of order l p ). 
If the order is l± — • • • — l p — q, for simplicity, we also write I(p, q) 
instead of I(W). We say that: 

a) A diagram has a flat edge if there is at least one pair 

such that i — i'; we write lp for the set of diagrams that have at least 
one flat edge, and lp otherwise. 

b) A diagram is connected if it is not possible to partition the rows 
h ■ • • l p of the table W into two parts, i.e. one cannot find a parti- 
tion K\ U K2 = {1, ■■■iP} that, for each member Vk of the set of edges 
(Vi, V r ) in a diagram 7, either Vk G U, e xi lj , or Vk G ^jeK 2 lj holds; 
we write Ic for connected diagrams, and Ijj otherwise. 

c) A diagram is paired if, considering any two sets of edges Ji) («2, J2)} 
{(^3, 3z){h-> J4)}) then i\ = i 3 implies 12 = i^, in words, the rows are com- 
pletely coupled two by two. 

The following, well-known Diagram Formula will play a key role in 
some of the computations to follow (see [46] and [57]). 



H q {u) = {-iye 



u- 



— e -" 2/2 . 

du 
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Proposition 2 (Diagram Formula) Let (X 1 , ...,X P ) be a centered Gaus- 
sian vector, and let 7^ = E[XiXj],i, j = l,...,p be their covariances, 
Hi x , Hi be Hermite polynomials of degree li, l p respectively. Let L 
be a table consisting of p rows l±, ....l p , where lj is the order of Hermite 
polynomial in the variable Xj . Then 

E[f[H h (X J )]= n ^ m 

3=1 GeI(h,...,l p )l<i<3<P 

Cum(H h (X 1 ),...,Hi p (X p ))= II ^f G) 

Gel c (h,...,l p ) l<i<j<P 

where, for each diagram G, is the number of edges between 

rows li,lj and C-um(iJ/ 1 (X 1 ), Hi (X p )) denotes the p-th order cumu- 
lant. 



5 Spin Needlets Spectral Estimator 

In this section, we shall establish an asymptotic result for the spectral 
estimator of spin needlets in the high resolution sense, i.e. we will inves- 
tigate the asymptotic behaviour of our statistics as the frequency band 
goes higher and higher. We note first, however, one very important is- 
sue. As we mentioned earlier, spin needlet coefficients are not in general 
scalar quantities. It is possible to choose a single chart to cover all points 
other than the North and South Pole; these two points can be clearly 
neglected without any effect on asymptotic results. The resulting spin 
coefficients will in general depend on the chart, and should hence be 
written as {^Rjks} ] however the choice of the chart will only produce 
an arbitrary phase factor exp(is7fc). The point is that, because in this 
paper we are only concerned with quadratic statistics, the phase factor is 
automatically lost and our statistics for the spin spectral estimator will 
be invariant with respect to the choice of coordinates. In view of this, 
from now on we can neglect the issues relative to the choice of charts; 
we will deal with needlet coefficients as scalar-valued complex quantities, 
i.e. we will take the chart as fixed, and for notational simplicity we write 
{Pjks} rather than {(3 R - jks } . 

We begin by introducing some regularity conditions on the polariza- 
tion angular power spectrum Ti, which are basically the same as in [21], 
see also [7], [8] and [32, 18, 33, 38] for closely related assumptions. 

Condition 3 The random field {Q + ill} (p) is Gaussian and isotropic 
with angular power spectrum such that 

Ci = l~ a g(l) > , where Cq 1 < g(l) < c , a > 2 , for all I G N , 
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and for every r e N there exist c r > snc/i t/iat 



|— g(u)\ < c r u~ r , u e (\s\ , oo) . 

Remark 4 TTie condition is fulfilled for instance by angular power spec- 
tra of the form 

r m 

1 lf*F 2 (l) ' 

where Fi(l), F 2 (l) > are polynomials of degree qi, q 2 > 0, j3+q 2 —qi = ex. 
By (16), it is readily seen that 

y^jkV \'k' ^ b (^^f J b \ ^2 E a l;msai';m,'sYh ms (£jfc) Yl';m's (€j'k') = 

i,v V / V / mm , 

because 

Eai- ms ai>- m > s = Eai imi;E ai 2m2 -E + 2Eai imi -Bai 2m2 -E + Eai imi -Eai 2m2 -B = . 



On the other hand, the covariance Cov (/3jk;s, Pjk';s) — EPj k . s {3j k r. s is in 
general non-zero. In view of (16, it is immediate to see that 



\C0V (Pjk;s: Pjk';s)\ 

where 



An 



K ls (p, P ')= Y i ms (p)Y lms (p>) . 



(18) 
(19) 



m=—l 



For k = k! we obtain as a special case from (11) that 



E \Pjk ;s \ 2 = Xjk^b 2 ( 
I ^ 



Bi ) 4tt 
From (18) and (20) we obtain 



(20) 



\Corr ((3 jk ;s, (3jk';t 



a 



(2H-1) 

47T 



(21) 
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The key result for the development of the high-frequency asymptotic 
theory in the next sections is the following uncorrelation result, which 
was provided by [21]; under Condition 3, 

\Corr (pjk-s, Pjk';s) I < — — — ——7m » for all M e N , some C M > 

{l + B3d(Z jk ,Z jk >)} 

(22) 

The analogous result for the scalar case is due to [7], see also [33, 38] 
for some generalizations. We recall also the following inequality ([42], 
Lemma 4-8), valid for some Cm depending only on M, which will be 
used in the following discussion: 



k> L 1 



< c M 



+ £'d(UW} M {! + &d(S jk ,,S jk „)} M {1 + #d(U&k")} Af 

(23) 

In view of (20), let us now denote 



(2/+ 1) 
An 



k k I ^ ' 

= E^ 2 (^)«(2(+i) ■ 

Under Condition 3, it is immediate to see that 

CoB (2-a)j < < C ljB (2-a)j ( 2 4) 

A question of great practical relevance is the asymptotic behaviour of 
Ylk \Pjk;s\ 2 as an estimator for Tj ]S ; for the scalar case, this issue was 
dealt with by [7], where a Functional Central Limit Theorem result is 
established and proposed as a test for goodness of fit on the angular 
power spectrum. In [48] , the needlets estimator was applied to the cross- 
spectrum of CMB and Large Scale Structure data, while [18, 19] have 
considered the presence of missing observations and observational noise, 
establishing a consistency result and providing further applications to 
CMB data. In the spin case, angular power spectrum estimation was 
considered by [21], under the unrealistic assumptions that the spin ran- 
dom field P — Q + iU is observed on the whole sphere and without 
noise. Here we shall be concerned with the much more realistic case 
where some parts of the domain S 2 are "masked" by the presence of 
foreground contamination; more precisely, we assume data are collected 
only on a subset S 2 \ G, G denoting the masked region. In this section, 
we do not consider the presence of observational noise, which shall be 
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dealt with in the following section. In the sequel, for some (arbitrary 
small) constant e > 0, we define G £ = {x e S 2 : d (x, G) < e} . Consider 



r ,w= E M E Wksl (25) 

where 

x tx . 



f3* jk = / P{x)ij jk . s (x)da 



'S 2 \G 

Our aim will be to prove the following 
Theorem 5 Under condition (3), we have 

f * — r 

■•■■ M! ^= ^ d N(0, 1) , as j -> oo 
Proof. The proof will be basically in two steps; define 



F J>G = E Xk 



E i/wi 2 ' ( 26 ) 



which is clearly an unfeasible version of (25), where the (3* k . s have been 
replaced by the coefficients (in the observed region) evaluated without 
gaps. The idea will be to show that 



Var 



,,,^,< Tj. sG > 
r ' sG r ' s -m N(0, 1) , V 
Var{f J , sG ] ^{f* sG } 

j G - Tj. sG 
and — / —> p (J , as j — > oo . 



{^} 



The proof of these three statements is provided in separate Proposi- 
tions below. ■ 

Proposition 6 As j — > oo, under Condition 3 we have 

p p 



{r, ;sG } 



► d iV(0, i) . 



19 



Proof. Notice that 
2 



E h Var (fj., s G^ = Var 



E i". 



jk;s\ 



| E(3j k . s f3jk'- S 

k,k' 



E 62 



e; s ^ „ (2/ + 1) /s 

L// A {t,jk,t,jk') 



Bo 



47T 



By standard manipulations we obtain the upper bound 



E \^^\ 



< CmB 2( - 2 a ^ J E AjfcAjfc' 



1 



sup Aj-fe 

it' 



E X i k E 



[1 + Bid(t jk ,t jk ,)} 
1 



2M 



[1 + c?(^-fc,^fc')] 2M 



in view of (22) (24) and \ jk m B 2j . On the other hand, we also have 
the trivial lower bound 

E \ E ^s~^\ 2 > E \ E ^sJjk~ s \ 2 = ij,E4- > cE A ^ 2(1 ~ a)J ' 



whence we have 



(27) 



By recent results in [45, 47, 44] it suffices to focus on fourth-order cu- 
mulant; the proof that 



Cum a 



J2k \Pjk;s\ 2 - (J2k "V) 



as j — > oo , 



is a standard application of the Diagram Formula, indeed we have 
Cum^ |E \Pjk;s\ 2 - ^E A i fe j 



6 ^ ] EPj krtS Pj krjS EPj k2 - jS Pj k3]S EPj k3 - tS Pj k4]S EPj k4 - jS Pj kl]S 

ki,k2,k^,k4 
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< c (r, ;s ) 4 x 



jk 



sup X jk > 



in view of (22) and (23). Thus the Proposition is established. 
Next we turn to the following 

Proposition 7 As j — > oo, under Condition 3 we have 



Var 



{hs G } 



Proof. Again in view of the Diagram Formula, it is enough to focus on 



Var[ Ifel 2 )-Varl £ \P* k , 



h h' h h> 

rx, jfii rv ■ rv 

Now notice that 

\Ep jk]S p jk/ . :S \ 2 - \Ep* jk . s p* k ,. s \ 2 
= E(3 jk . s (3 jk ,. s (Ef3 jk . s f3 jk i. s - E/3* k . s /3* k ,. s ) 

+ E Pjk;sPjk';s (E(3jk;s(3jk>;s ~ E (3* k . s (3* k , . g ) , 



and 



(28) 



EP jk ; S Pjk';s - E Pjk;sPjk';s = E Pjk;s (Pjk';s ~ Pjk';s) + E Pjk';s (Pjk;s ~ Pj k - S ) 



< {Elf^slT 2 {E\(3 jk , s - ^ ;s | 2 } 1/2 +{i?|/3; fe , ; J 2 } 1/2 {E\J^ S - ^| 2 } 

(29) 

Hence 



21 1/2 



E\P jk -,s-P* k .J 2 <E\ [ P(x)ij jk]S (x)dx) <E\sup{i> jk ., s (x)} [ \P{x)\dx 



< 



sup {ip jk . s (x)} 
xeG 



E{ j \P(x)\dx 



< 



SUp {lpj k ;s(x)} 

xeG 



E 



ldx 



\P(x)\ 2 dx 
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1 2 



SUp {lpjk;s(x)} 

xeG 



< 4tt 
Now recall that 



E 



\P(x)\ 2 dx 



G 



o 



\[l + B3e] 2M 



E\(3 ]k , s \ 2 = O (B-*) , 
whence E\(3* k . s \ 2 = O (B~ a i) , if M > a/2 + 1. Hence, in view of (29) 



# CB {1 - a ' 2 ^ 

\ E Pjk-,sPjk>,s- E (3jk,s(3 jk ',s\ < p - ^-p , 

for some constant C > 0. Also, from (28) and (30) we obtain that 

E (|^/5jfc;su/3jfc'; SU | - \ E Pjk;suPjk';su\ ) 



(30) 



< E (l^/wra + l^;*i.^ri) 

#(l-a/2)j 



< o 



[1 + 5%] M 
< o 



B-ja/2 
[1 + 5^] M 

Cu^/^jk^jk' 



B 3(l-a/2)j 



M 



[i + 5%; 



Ai 



(E* V) O (S^-W) 



Recall from (27) that V"ar ^Efc-£- fc es 2 \G 
Hence for M large enough, that isM>l + o;/2, the statement of the 
Proposition is established. ■ 



Proposition 8 As j — > oo, under Condition 3 we have 

n0 ■ 



T *j;sG ~ T j;sG 



Var 



Proof. We have 





• 












E ^ 




f-{ 






k-4 jk &2\ G e 







which we can expand as follows 
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^E^ fe. - J }{E - S3 

= E [ E Pdk;s (Pjk' ]S - /3* k ,. s ) E(3 jk ,. s (p jk . s - (3* k . s ) 

k,k> 

+ E Pjk;s (Pjk';s ~ P* k f- S ) E(3* k ,. s (P jk;s ~ P* k . s )] 

+ | E E ^ {ftjk;s - P* k , s ) | + | E E(3* k;s (fe - ^) J 

+2 1 E ^ fe- - } I E (fe - ^3 } 

+2 { £ eJl^Ts (p Jk , ]S - p* k , s ) Ep; k , s - 0jQ 



it,*;' 



+2 J £ E(3 jk , s (3* k , s E (p jk . >s - p* k;s ) {(3 jk , s - (3* k , s ) 
I fc,fc' 

Now recall again 
E |fe| 2 , E \(3* k .f < CB-«\ and E |fe - ^J 2 < ^ + 
whence from the same steps as in the previous Proposition, we have 



EPjk;* (Pjk>; S - Pjk'-s) , Ef3* k . s (P jk/ . s - (3* k , s ) < - — p . 
It follows that 

E |E l^l 2 ~ fe-f J ^ [l + Bie} 2M ' 
By arguments in the previous Propositions, we know that 



Var 



E V U 2(1 - a)j ; 



thus the statement is established, provided we take M > 2 + a/2. 
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Remark 9 In general the expression for Var 



{f r)SG },Var{f* ;sG } de 



pends on the unknown angular power spectrum. However, the normal- 
izing factors can be consistently estimated by subsampling techniques, 
following the same steps as in [8]. 

6 Detection of asymmetries 

In this Section, we shall consider one more possible application of spin 
needlets to problems of interest for Cosmology. In particular, a highly 
debated issue in modern Cosmology relates to the existence of "features" , 
i.e. asymmetries in the distribution of CMB radiation (for instance be- 
tween the Northern and the Southern hemispheres, in Galactic coordi- 
nates). These issues have been the object of dozens of physical papers, 
in the last few years, some of them exploiting scalar needlets, see [49]. 

In order to investigate this issue, we shall employ a similar technique 
as [8] for the scalar case. More precisely, we shall focus on the difference 
between the estimated angular power spectrum over two different regions 
of the sky. Let us consider Ai,A 2 , two subsets of S 2 such that A\ n A 2 = 
0; we do not assume that A 1 U A 2 = § 2 , i.e. we admit the presence 
of missing observations. For practical applications, Ai and A 2 can be 
visualized as the spherical caps centered at the north and south pole N, S 



(i.e. A 1 = {x G § 2 : d(x, N) < tt/2} , A 2 = {x G § 2 : d(x, S) < tt/2} , 



but the results would hold without any modification for general subsets 
and could be easily generalized to a higher number of regions. We shall 
then focus on the statistic 



j-sAi 




where 





jk;s 



2 



, some e > . 



We are here able to establish the following 
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Proposition 10 As j — > oo , we have 



( 






-1/2 


Var • 


{^*j;sAi}_ 








-1/2 


\ 


Var - 


[f^ ;Sj42 }_ 



r* — r • 



^iV(0 2 ,/ 2 ) , 



where (02,/ 2 ) are, respectively, the 2x1 vector of zeros and the 2x2 
identity matrix. 

Proof. By the Cramer- Wold device, the proof can follow very much the 
same steps as for the univariate case. We first establish the asymptotic 
uncorrelation of the two components, i.e. we show that 



lim 

Now 



^{ F u}^{f^}]" 1/2 s{(r; ;aAl -r J - i .) (f* ;M2 - r,-„) } = o . 



(31) 



+ { E A * E A 4 E E 

In view of (22) and Proposition 8, we have 



(32) 



|(32)| < (r,,) 2 £ E 



CXjkXjk' 



2M 



CiTj-sf [sup fc A ifc ] 2 _ / R 2(l-a-M)j\ 

- [1 + 2&ef M - 1) [ h 

Thus (31) is established, in view of (27) and Propositions (7), (8). For 
the fourth order cumulant, given any generic constants u, v, we shall 
write 

X = u 



and 



Recall that 



Y = v 



^{ f U}]" 1/2 (^ 2 -r, s ) • 



(33) 
(34) 



C™ 4 (X + Y) — Cum A (X) + C M m 4 (Y) + 4Cwm(X, Y, Y, Y) 
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+6Cum(X, X, Y, Y) + 4Cum(X, X, X, Y) ; 

by results in the previous Section, we have immediately Cum^ (X) , 
Cum^ (Y) — > 0, as j — > oo. On the other hand, in view of Proposition 
8 and the equivalence between convergence in probability and in LP 
for Gaussian subordinated processes (see [30]), we can replace r*. sA . by 

Tj.gAi in (33) and (34), and we have easily 

Cum(X, Y, Y, Y) 



<CB^{Y, s f Y, E 



M 



X- 



fc:C jfe es2\A ! ^ 1 ,..^ fe3 e§2\A| t 1 + B J d(^ jk ,^ jkl )} 
1 

[1 + Bid(£ jha , i jkl )\ M t 1 + Bid(t jks , i jk2 )] M [1 + Bid(Z ik , t jk3 )] M 

cb^-^ (r j;s ) 2 [su Pfc \ 3k f (R -2 (M+m 

[l + 2Bief M ~ 1) [ >' 

Similarly, we have 

Cum{X, X, X, Y), Cum{X, X, Y, Y) < CB~^ M+1 ^. 

Thus the Proposition is established, provided we choose M > 2 + a. 

Remark 11 An obvious consequence of Proposition 10 is 

r* ;sAl - r,*;M 2 %/ V(() ^ 



77ms restt/^ provides the asymptotic justification to implement on polar- 
ization data the same testing procedures as those considered for instance 
by [49] to search for features and asymmetries in CMB scalar data; i.e., 
it is possible to estimate for instance the angular power spectrum on 
the Northern and Southern hemisphere and test whether they are sta- 
tistically different, as suggested by some empirical findings of the recent 
cosmological literature. 

7 Estimation with noise 



In the previous sections, we worked under a simplifying assumption, i.e. 
we figured that although observations on some parts of the sphere were 
completely unattainable, data on the remaining part were available free 
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of noise. In this Section, we aim at relaxing this assumption; in par- 
ticular, we shall consider the more realistic circumstances where, while 
we still take some regions of the sky to be completely unobservable, 
even for those where observations are available the latter are partially 
contaminated by noise. 

To understand our model for noise, we need to review a view basic 
facts on the underlying physics. A key issue about (scalar and polar- 
ized) CMB radiation experiments is that they actually measure radia- 
tion across a set of different electromagnetic frequencies, ranging from 
30 GHz to nearly 900. One of the key predictions of Cosmology, whose 
experimental confirmation led to the Nobel Prize for J.Mather in 2006, 
is that CMB radiation in all its components follows a blackbody pat- 
tern of dependence over frequency. More precisely, the intensity I a is 
distributed along to the various frequencies according to the Planckian 
curve of blackbody emission 



Here, A is a scalar quantity which is the only free parameter in (35), and 
therefore uniquely determines the shape of the curve: we have A = T for 
the traditional temperature data, whereas for polarization measurements 
one can take A — Q, U. Now the point is that, although there are also a 
number of foreground sources (such as galaxies or intergalactic dust) that 
emit radiation on these frequencies; all these astrophysical components 
(other than CMB) do not follow a blackbody curve. 

We shall hence assume that D detectors are available at frequencies 
ui, vd, so that the following vector random field is observed: 



here, both P(x), N v (x) are taken to be Gaussian zero-mean, mean square 
continuous random fields, independent among them and such that, while 
the signal P(x) is identical across all frequencies, the noise N v (x) is not. 
More precisely, we shall assume for noise the same regularity conditions 
as for the signal P, again under the justification that they seem mild and 
general: 

Condition 12 The (spin) random field N v (x) is Gaussian and isotropic, 
independent from P(x) and with total angular power spectrum {Cin} 
such that 

Cin = r J g N (l) > , where c^ N < g N (l) < c 0N , 7 > 2 , I E N , 



Ia(v,P) 



2hu 3 



1 



(35) 




P Vr {x) = P(x) + N Vr (x) ; 
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and for every r G N there exist c r > such that 
d r 

\^9n(u)\ < c rN u' r , u e (\s\ , oo) . 

It follows from our previous assumptions that for each frequency v r 
we shall be able to evaluate 



/ 



P Vr (x)i> jk . s (x)dx =: (3 jk - sr = (3 jk . sP + f3 jk - sNr 

where clearly 



(3jk; S p= / P{x)i) jk . s {x)dx , Pjk-sNr = / N Vr (x)ip jk . s (x)dx . 



Now it is immediate to note that 

|2 



E\Pjk;sr\ — E\Pjk;sP + Pjk;sN r 



— Ef3j k . sP f3j k . sP + E(3j k . sNr (3 jk . sNr + Ef3j k . sNr f3 jk . s p + Ef3j k . sP f3 jk . sNr 

= E\f3j k]sP \ 2 + E\f3j k . sNr \ 2 , 

so that the estimator J2 k \Pjk-,sr\ 2 will now be upward biased. In the 
next subsections we shall discuss two possible solutions for dealing with 
this bias terms, along the lines of ([52]), and we will provide statistical 
procedures to test for estimation bias. We note first that correlation of 
needlet coefficients across different channels are provided by 



E(3j k - sr (3j k '- S r — E(3j k . s p[3j k >. sP + Ef3j k . sNr f3j k >. S N r ■ 

Denote 

k I 

as before, it is easy to obtain that dB^-^i < Yf s < C 2 B^~^ . With 

the same discussion as for (22) provided by [21], we have that, under 
Condition 3 and 12, 

| Corr (p jk;sr , PJ^) | < — , for all M e N . (36) 
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7.1 The Needlet Auto-Power Spectrum Estimator 

In many circumstances, it can be reasonable to assume that the angular 
power spectrum of the noise component, C^v, is known in advance to 
the experimenter. For instance, if noise is primarily dominated by in- 
strumental components, then its behaviour may possibly be calibrated 
before the experimental devices are actually sent in orbit, or otherwise by 
observing a peculiar region where the signal has been very tightly mea- 
sured by previous experiments. Assuming the angular power spectrum 
of noise to be known, the expected value for the bias term is immediately 
derived: 

feT S) 2l + 1 



E\(3 jk . sNr \ 2 = ^2 b2 (-^f)^— c iN r 



whence it is natural to propose the bias-corrected estimator 

f/ P := Jj^2^2 {\Pjk;sr\ 2 - E\(3 jk - sNr \ 2 } 



k 

D 



~ T>^2 ^2 {(& k < sP + Pjk;sN r ) {Pjk;sP + Pjk;sN r ) ~ E \(3jk;sN r ? } 
k 



- \Pjk;sP\ 2 +Jj \ y~]y] {f3jk;sP(3jk;sN r + Pjk;sN r Pjk;sP + [\Pjk;sN r f ~ E\Pjk;sN r \ 2 ]) \ ■ 

k I k r J 

We label the previous statistics the needlet auto-power spectrum es- 
timator (AP, compare [52]). The derivation of the following Proposition 
is rather standard, and hence omitted for brevity's sake. 

Proposition 13 As j — > oo, we have 



3 3 ^ iV(0,l) , 
Var |r/ p } 

where 

Var |r/ P } = 0(£2(l-min(a )7 ))j) _ 

As before, the normalizing variance in the denominator can be con- 
sistently estimated by subsampling techniques, along the lines of [8]. 

It should be noticed that the rate of convergence for |r^ p — r 3 -j = 
(3(£?(i-m»iO>7)b') j s the same as in the noiseless case for 7 > a, whereas 
it slower otherwise, when the noise is asymptotically dominating. The 

"signal-to- noise" ratio Fj/ ^Var |r/ P | is easily seen to be in the or- 
der of S2j-ajy s (i-min(a, 7 ))j = s j(i+min(a ) 7)-a) j w h e nce it decays to zero 
unless a < 7 + 1. 
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7.2 The Needlet Cross-Power Spectrum estimator 

To handle the bias term, we shall pursue here a different strategy than 
the previous subsection, dispensing with any prior knowledge of the spec- 
trum of the noise component. The idea is to exploit the fact that, while 
the signal is perfectly correlated among the different frequency compo- 
nents, noise is by assumption independent. We shall hence focus on the 
needlets cross-angular power spectrum estimator (CP), defined as 

^ ' = D(D — I) ^ ^ ^ k ' ST1 ^ k ' sr2 

k r\^ri 

= D / D _ 1) {Pjk;sP + Pjk;sN ri ) {Pjk;sP + Pjk;sN r2 ) 

k;sp\ £)(£) — l) | {ftjk;sPf3jk;sN r2 + Pjk;sN ri Pjk;sP + Pjk;sN ri Pjk;sNr 2 ) 

k \ k ri^=r2 

In view of the previous independence assumptions, it is then imme- 
diately seen that the above estimator is unbiased for Tj, i.e. 

Eff F = E E i/wi* = E ft2 (#)^- 

k I 

We are actually able to establish a stronger result, namely 
Proposition 14 As j — > oo, we have 



T CP - r 
Far jff^ 



> d 7V(0, 1) , Var jrf p } = o(5 2(1 - min(a ' 7))j ) . 



We omit also this (standard) proof for brevity's sake. We can re- 
peat here the same comments as in the previous subsection, concerning 
the possibility of estimating the normalizing variance by subsampling 
techniques, along the lines of [8], and the roles of a, 7 for the rate of 

convergence |f^ p ~ T j} = 0(-B (1_min(a ' 7))j '). 

7.3 Hausman Test for Noise Misspecification 

In the previous two subsections, we have considered two alternate esti- 
mators for the angular power spectrum, in the presence of observational 
noise. It is a standard result (compare [52]) that the auto-power spec- 
trum estimator enjoys a smaller variance, provided of course that the 
model for noise is correct. Loosely speaking, we can hence conclude 
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that the auto-power spectrum estimator is more efficient when noise is 
correctly specified, while the cross-power spectrum estimator is more 
robust, as it does not depend on any previous knowledge on the noise 
angular power spectrum. An obvious question at this stage is whether 
the previous results can be exploited to implement a procedure to search 
consistently for noise misspecification. The answer is indeed positive, as 
we shall show along the lines of the procedure suggested by [52]. 

Proposition 15 Under Assumptions 3 and 12 , we have 

pC*P _ pAP 

" r ' -,^(0,1), 



where 

Var{Tf ] f-T^} = 0(B^) 

Proof. The proof is again quite standard, and we only need to provide 
the main details. Notice first that 



T f;s~ T tS ~ D(D-l) E Pjk; S r 1 Pjk;sr 2 --p ^ 53 (l/W^ ~ E I Pjk;sN r 1 2 } 

k r\^r 2 k r 

= D f D _ y^A( D ~ l )^2, E \Pjk;sN r \ 2 ~ J3 \Pjk\eri ~ Pjk;sr 2 \ 2 > , 
' k \ r ) 

and applying again the Diagram Formula, we have that 

= D 2 (D — \) 2 ^ ^ \ E (Pjkusn — Pjky,sr 2 ) {Pjk 2 ;sr 3 ~ Pjk 2 ;sr 4 ) \ ■ 

ki,k 2 r\^r 2 r 3 ^r^ 

Similarly to the discussion for (27), we can show that 
Var (rjjf - f f s ) = O (D 2 B^-^) . 
Once again, the next step is to consider the fourth order cumulants, 

Cum 4 I ^2 I 53 \& k -< sr i ~ Pjk;sr 2 \ 2 ~ (D - l)^2E\[3 jk . sNr 

[_ k \r\^r 2 r 
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— 6 ^ ] ^ ] E (Pjki;sri Pjki;sr2) {Pjk 2 ;sr 3 Pjk2;sr4) 

ki,..,k4 r2„=£r2n-i,n=l,..,4 

;sr 3 fijk 2 ;sr i ) {Pjk 3 ;sr 5 Pjk 3 ;sre) E (Pjk 3 ;sr 5 Pjk 3 ;sr 6 ) [ftjk^sry Pjk^srs) 



^jk!^jk2^jk 3 ^jk 4 



X- 



/, , .../, ;4 [1 + d(£jfci> Cjfc 2 )] M t 1 + d(£jk2,£jk 3 )] M 



in view of (36), choosing M > 3. Now it is easy to see that 

whence the Proposition is established, again resorting to results in ([45]) 
■ 

Remark 16 Note that Var jf £f } , Var jf £f } , 2Ccw jf£f , f^f } are 
robust to mis specification of the noise, because Variance and Covariance 
are translation invariant. It follows that the denominator can (once 
again) be consistently estimated by subsampling techniques, as in [8]. 

Under the alternative of noise misspecification, we have easily 

yCP _ yAP 



CP _ yAp\ 

S -)'•<= 



E\/3jk- yS N r \ 2 — ^j;sN, 



where 



8 S 



sjvar {f^-r-} 
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where Tj. sNr is the bias-correction term which is wrongly adopted. The 
derivation of the power properties of this testing procedure is then im- 
mediate. 

As a final comment, we notice that throughout this paper we have 
only been considering estimation and testing for the total angular power 
spectrum C/ = Cie+Cib- The separate estimation of the two components 
(E and B modes) is of great interest for physical applications, and will 
be addressed in future work. 
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